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Abstract 

A panoramic image has 360° horizontal field of view, and it can provide the viewer the impres- 
sion of being immersed in the scene to a certain extent. A panorama is created by first taking a 
sequence of images while rotating the camera about a vertical axis. These images are then projected 
onto a cylindrical surface before being seamlessly composited. The cross-sectional circumference 
of the cylindrical panorama is called the compositing length. 

This work characterizes the error in compositing panoramic images due to errors in intrinsic 
camera parameters. The intrinsic camera parameters that are considered are the camera focal 
length and the radial distortion coefficient. We show that the error in the compositing length is 
more sensitive to the error in the camera focal length. Especially important is the discovery that 
the relative error in compositing length is always smaller than the relative error in the focal length. 
This means that the error in focal length can be corrected by iteratively using the composited length 
to compute a new and more correct focal length. This compositing approach to camera calibration 
has the advantages of not requiring both feature detection and separate prior calibration. 

Keywords: 3-D scene modeling, image compositing, camera calibration. 
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1 Introduction 

A panoramic image has a 360° horizontal field of view. Panoramic images of scenes have interest- 
ing applications in computer vision and visualization, since they are able to provide the viewer the 
impression of being immersed in the scene to a certain extent. For example, Apple's QuickTime 
VR™ [Chen, 1995] product uses panoramas for scene visualization. In computer vision, applying 
the stereo algorithm on multiple panoramas allows the entire 3D scene data points to be extracted 
[Kang and Szeliski, 1995] and subsequently modeled [Kang etal, 1995]. 

A panoramic image is produced by following a series of steps: First, a sequence of images is 
taken while rotating a camera about a vertical axis that passes through the camera optical center. 
Each image in the sequence is then projected onto a cylindrical surface whose cross-sectional 
radius is an initially estimated focal length (see Figure 1). The panoramic image is subsequently 
created by determining the relative displacements between adjacent images in the sequence and 
compositing the displaced sequence of images. The length of the panoramic image is termed the 
compositing length. 

1.1 Analyzing the error in compositing length 

In this technical report, we describe the effect of errors in intrinsic camera parameters on the 
compositing length. We are not aware of any prior work in this specific area. In particular, we 
consider the focal length and the radial distortion coefficient. An important consequence of this 
analysis is that a much better estimate of the camera focal length can be calculated from the current 
compositing length. Hence by iterating the process of projecting onto a cylindrical surface (whose 
cross-sectional radius is the latest estimation of the camera focal length) and compositing the new 
sequences, we quickly arrive at the camera focal length within a specified error tolerance. We show 
later that the convergence towards the true focal length is exponential. This method constitutes a 
simple means of calibrating a camera. 

1.2 Camera calibration 

Conventional means of camera calibration use a calibration or control pattern (e.g., points [Szeliski 
and Kang, 1994; Tsai, 1987; Weng et al, 1992], lines [Beardsley and Murray, 1992; Caprile 
and Torre, 1990; Wang and Tsai, 1991]), or take advantage of feature structural constraints (e.g., 
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roundness of spheres [Penna, 1991; Stein, 1995], straightness of lines [Brown, 1971; Stein, 1995]). 
We propose a method for featureless camera calibration based on an iterative scheme of projecting 
rectilinear images to cylindrical images and then compositing. In addition to determining the 
camera focal length, this technique results in a panoramic image (i.e., with 360° horizontal field of 
view) that is both physically correct (ignoring radial distortion effects) and seamlessly blended. 

The basis of the compositing approach to camera calibration is the discovery that the relative 
compositing length error due to camera focal length error is disproportionately much less (i.e., in 
terms of percentages) than the relative focal length error. When a planar image is projected to a 
cylinder as in the compositing process, mis-estimation of the radius of the cylinder will produce an 
erroneous warping of the image that will affect the length of the final composite image. However, 
it turns out that near the center of the overlap between successive images, the amount of combined 
warping for both images is minimal and so is the effect of the mis-estimation. The result of this 
is that the percent error in length of the composite image is less than the percent error in the focal 
length. This makes it possible to use the composite length to recover a better estimate of the camera 
focal length. 

The proposed technique has the advantage of not having to know the camera focal length when 
a panorama is to be generated from a sequence of images. This is in contrast to Apple's QuickTime 
VR™ [Chen, 1995], which we believe that a reasonably good estimate of camera focal length is 
required a priori. This is also the case for McMillan and Bishop's method of creating panoramas 
[McMillan and Bishop, 1995]. Their method of estimating the camera focal length necessitates 
small panning rotations and relies on translation estimates near the image centers. 

The method of calibration that is closest to ours is that of Stein's [Stein, 1995], in which 
features are tracked throughout the image sequence taken while the camera is rotated a full 360°. 
While this technique results in accurate camera parameters, it still requires feature detection and 
tracking. Our technique directly uses the given image sequence of the scene to determine camera 
focal length without relying on specific tracked features. 

1.3 Motivation and outline 

The motivation for generating panoramic images is to directly recover 3-D scene data points over a 
wide field of view using stereo [Kang and Szeliski, 1995] for subsequent modeling and photorealis- 
tic rendering [Kang et al, 1995]. Traditional approaches to recovering 3-D data of a wide scene is 
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Figure 1: Compositing multiple rotated camera views into a panorama. The ' x ' marks indicate the 
locations of the camera optical and rotation center. 



to take stereo snapshots of the scene at various poses and then merge these 3-D stereo depth maps. 
This is not only computationally intensive, but the resulting merged depth maps may be subject 
to merging errors, especially if the relative poses between depth maps are not known exactly. The 
3-D data may also have to be resampled before merging, which adds additional complexity and 
potential sources of errors. 

The outline of this technical report is as follows: Section 2 reviews how a panoramic image is 
produced from a set of images. This is followed by section 3 which gives a detailed analysis of the 
compositing error due error in the camera focal length in Section 3. A consequence of this analysis 
is the iterative compositing approach to camera calibration. Section 4 looks at the effect of mises- 
timating the radial distortion coefficient on the panoramic compositing length. We then describe 
the effect of errors in both focal length and radial distortion coefficient on the reconstructed 3-D 
data in section 5 before summarizing in section 6. 



2 Generating a panoramic image 

A panoramic image is created by compositing a series of rotated camera image images, as shown 
in Figure 1 . In order to create this panoramic image, we first have to ensure that the camera is 
rotating about an axis passing through its optical center, i.e., we must eliminate motion parallax 
when panning the camera around. To achieve this, we manually adjust the position of camera 
relative to an X-Y precision stage (mounted on the tripod) such that the motion parallax effect 
disappears when the camera is rotated back and forth about the vertical axis [Stein, 1995]. 

In previous work [Kang and Szeliski, 1995], the camera is first calibrated to extract its intrinsic 
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parameters, namely k, the radial distortion coefficient, and /, the camera focal length. This is 
accomplished by taking snapshots of a calibration dot grid pattern at known spacings and using the 
iterative least squares algorithm described in [Szeliski and Kang, 1994]. As a result of our analysis 
reported here, this calibration step can be skipped if the radial distortion coefficient is insignificant. 
A panoramic image is created using the following steps [Kang and Szeliski, 1995]: 

1 . Capture a sequence of rotated camera views about a vertical axis passing through the camera 
optical center; 

2. Undistort the sequence to correct for k; 

3. Warp the undistorted (rectilinear) sequence to produce a corresponding cylindrical-based 
image sequence whose cross-sectional radius is equal to the camera focal length /; and 
finally 

4. Composite the sequence of images [Szeliski, 1994]. 

The compositing technique comprises two steps: rough alignment using phase correlation, and 
iterative local refinement to minimize overlap intensity difference between successive images. In 
both steps, the translation is assumed to be in one direction only, namely in the x-direction (since 
the cylindrical-based images have been "flattened" or unrolled). This is a perfectly legitimate 
assumption, since camera motion has been constrained to rotate about a vertical axis during image 
sequence capture. If the estimated focal length is exact, then the error in the composited length is 
due to the digitization and image resampling effects, the limit in the number of iterations during 
local matching, and computer truncation or rounding off effects. 

The rough alignment step has been made more robust by adding the iterative step of checking 
the displacement corresponding to the peak — if the intensity RMS error in matching the overlap 
regions is high, the peak is tagged as false and the next highest peak is chosen instead. 

3 Compositing errors due to misestimation of focal length 

Compositing errors occur as a result of using a wrong value of the camera focal length in converting 
the rectilinear images to cylindrical-based images prior to compositing. If the correct focal length 
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used, say f true , then the expected length of the composited panoramic image 1 is 



L = 27Tf t 



true 



(1) 



If the focal length / used is incorrect, then the mapped cylindrical images are no longer phys- 
ically correct. The compositing step will attempt to minimize the error in the overlap region of 
successive images, but there is still a net error in compositing length L. 

Since each column in a rectilinear image is projected to another column in the cylindrical 
image and translation is constrained to be along the x-direction, it suffices to consider only a 
scanline in our analysis. We assume for simplicity that the images are "fully textured" so that 
matching between corresponding pixels is unambiguous. We also assume, for ease of analysis, 
that the amount of camera rotation between successive frames is the same throughout the sequence 
(this need not be so in practice). In our analysis, the net translation is computed by minimizing 
the sum of squares of the pixel displacements from their matching locations. In other words, even 
after translation with interpolation, the pixels in the second image will not match the pixels in the 
first image at the same location. Each pixel will match one at a displaced location. The translation 
which minimizes the sum of their squares is the one that results in zero average displacement. 

3.1 Derivation 

In order to model the displacement of each pixel u t in the second cylindrical image, we map it 
back to U in the image plane, find the corresponding pixel s t in the first image based on the actual 
rotation a, and map that back to v t in the cylindrical image. This is illustrated in Fig. 2, where 
h and I 2 are the first and second cylindrical images, respectively, hj rU e is the true cylindrical 
first image while a is the amount of actual camera rotation between successive frames, i.e., 2ir/N, 
N being the number of images in the sequence. Recall that the cylindrical images are formed by 
warping the images into a cylindrical surface whose cross-sectional radius is the estimated focal 
length. The mappings are given by the following equations: 



1 In creating the panoramic image, the order of compositing is h, h, In -it In, h, where Ik is the kth image in 
the sequence and N is the number of images in the sequence. The compositing length L is actually the displacement 
of the first frame I\ relative to its original location. 



t{ = f tan 
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(c) 

Figure 2: Effect of inexact focal length: (a) actual mapping from second cylindrical image; (b) 
theoretical displacement; (c) actual mapping to first cylindrical image. See text. 

Si = ftme tan ^tan -1 ^ + ( 2 ) 
Vi = /tan" 1 (^j 

As before, a = 2ir/N and f true is the correct focal length while / is the estimated focal length 
used. 

Using Mathematica™ [Wolfram, 1991], we find that, up to the third order in it; and a, 

P P _L_ f2 2 _ X2 2 
/ \ I J J true ' J a i J truest 
Vi (it;) = U t H — a 



true 
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Figure 3: Graph of error in displacement vs. pixel location for varying estimated focal length /. 

jtme = 274.5, N = 50, and I = 232. 

+ TfU « (3) 

°J J true 

The displacement between two successive frames at u 8 is o? 8 (u 8 ) = Vi (it;) — it;; the plot of the 
variation of c/;(u;) versus it; for / true = 274.5 and N = 50 for different values of misestimated 
values of / is shown in Fig. 3. It is interesting to note that the minimum displacement due to 
focal length error occurs near the center of the overlap. The change in the displacement error 
distribution due to the amount of overlap (changing number of frames N) is shown in Fig. 4. As 
can be observed, as N increases, the amount of overlap increases, and interestingly enough, the 
overall error decreases. If all are kept constant except for the image length /, the error distribution 
remains the same save for ui and u 2 , the two end pixel locations of the overlap area. They shrink 
to decrease the amount of horizontal overlap as / decreases. The mean displacement between two 
successive frames (see Fig. 5) is 

u 2 — u t + 1 

Note that if the interframe displacement is equal throughout the sequence and that the length of 
each image is /, then u 2 = 1/2 and u x = 2irf true /N — 1/2. The mean total displacement, i.e., the 
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Figure 4: Graph of error in displacement vs. pixel location for varying number of frames N. 

jtme = 274.5, / = 294.5 and I = 232. 



composite length, is given by 



L = ND 



(5) 



Suppose we reestimate the focal length from the composite length, namely /' = Lj (2ir). The 
question is: Is /' a better estimate of the focal length than /? It turns out that for initial estimates 
of / close to the true value, the answer can be shown to be yes. To see this, if / ~ f trU e, then from 
(3), (4) and (5), we get 
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Figure 5: Overlap between successive images and I k , with centers at Ok-i and Ok respec- 
tively. 
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noting that a = 2ir/N, and where 



E 



"2 



^2—^1 + 1 



1 / , \ ^ ftrue 
-{U 2 + Ul ) = —jy—; 



(7) 



if 



^2 — ^1 + 1 
1 



1 



- ^uj + U X U 2 + u\ + - (lt 2 
I [ ^VLe _ f 2^ 

3 N 2 \ N 



Ui 



^ ( (u 2 + «i) 2 



+ 



AT 



f 
2 



(8) 



and 



«2 



E 



^2 — ^1 + 1 



="1 



10 



3 Compositing errors due to misestimation of focal length 



i (u 2 + u x ) (u 2 2 + u\ + u 2 



Ui 



1 7f ftrue 

2 AT 



'27T/, 



true 



N 



l\ I 2 ^ ftrue 

2 I AT 



(9) 



Hence 



true 



r 



+ 



ftrue 
ftrue 



+ 



u--( 5^ rue 



f 2 / f 2 

J / J true 



a 



Ji 



true 



P 3; f t , 



a 



'f 



2 

true 



ftrue ftrue ~T~ ,/* f 

"7 2T - ' 



+ 



it 



2 u 3 



true 



Ji 



true 



3/; 



'3 

true , 



a + 



1 2 u 2 



C2 

' true 



fx 



2 

true 



+ 



it 



2 u 3 \ 1 u 2 

"7^r + 3 1_ V - 

' true / V «/ true . 



Let 



ft 



ir 



p 

J true 



and 



ft 



f 3 

J true 



1 

~ 3 

1 7T 

2iV 



4^ 
A^ 



3/1 

ft 
/2tt 



v 3 3/ 

« 2 f (/true - /) 



a 2 f (/true - /) 



ftrue 

I 



IT 

AT 



2^ 



2 ftrue J ^ftrue 

1 y f , * 



+ 



1 



7T 



ftrue \ 2 ftrue t 

2 f TT I 



+ 



(10) 

(11) 
(12) 

(13) 



(14) 



N 2 ftrue J \2 ft rue J ftrue \ A~ 2 ftrue : 
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ftrue ~ f ~ 2 \fh + (£1 - -ft J a + - (1 - 2/3 2 ) a 2 j (f true - f) = rj (/* rue - 

If A" is large, which is typical (in practice, A" is about 50), then \f trU e — f'\ "C |/t r « e — /I- This 
implies that the estimated focal length based on the composited length is a significantly better 
estimate. 
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3.2 Image compositing approach to camera calibration 

The previous result suggests a direct, iterative method of simultaneously determining the camera 
focal length and constructing a panoramic image. This iterative image compositing approach to 
camera calibration has the advantages of not requiring both feature detection and separate prior 
calibration. The pseudocode associated with this method is as follows: 
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Let the initial estimate of focal length be fo . 
Determine compositing length L 0 from f 0 . 
Set k = 1 . 

1. Calculate fk = Lk-\/ (2ir) . 

2. Determine compositing length Lk from fk. 

3. If (\L k - L k -i.\ > e) { 

k <r- k + 1 

Go to Step 1 . 

} 

else 

fk is the final estimated focal length. 

Since we know that the iterated value of f k converges toward f trU e, it would be interesting 
to determine its rate of convergence. (14) can be rewritten as a recurrence equation (assuming 
equality rather than approximation) 

ftrue ~ fk = f] (ftrue ~ fk-l) (15) 

Rearranging, we have 

fk ~ ifk-i = (1 - rj) f true , (16) 
from which the solution can be found to be 

fk = ftrue + (fo — ftrue) (17) 

Hence, the convergence of fk towards f true is exponential in the vicinity of the true focal length, 
as shown by (17). This also indicates that the convergence is faster if the number of frames N 
increases, the image length / decreases, or the true focal length f true increases. As an example, for 
N = 50,1 = 232, and f true = 274.5, r/ = 0.117. 

The graph in Figure 6 shows the convergence of estimated focal length from different initial 
estimates. (A sequence of the synthetic room is shown in Figure 7 and the corresponding compos- 
ited image is shown in Figure 8.) It is interesting to note that the actual estimated focal lengths are 
smaller than theoretically predicted ones. One of the reasons could be due to effects of resampling 
using bilinear interpolation. In addition, we also make the assumption that each point is "fully 
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Figure 6: Graph of estimated focal length vs. number of iteration. fO is the initial estimated focal 
length; the actual focal length is 274.5. The solid lines represent actual values whereas dashed 
lines represent predicted values. 

textured," which is difficult to realize in practice and even in simulations. Finally, shifts greater 
than 1 pixel are not likely to influence the net shift correctly. 

A panorama of the synthetic room is shown in Figure 9. As can be seen, the effect of misesti- 
mating the focal length in compositing is a blurring effect, presumably about the correct locations. 
When a rectilinear image is projected onto a cylindrical surface of the wrong cross- sectional ra- 
dius, which is also the estimated focal length, the error in pixel placement increases away from the 
central image column. Having many cylindrical-converted images superimposed would thus have 
the effect of locally smearing the correct locations. This suggests that a good scheme of composit- 
ing many images to form a panorama is to down-weight the pixels away from the central image 
column during compositing. Indeed, Figure 10 shows the effect of using such a simple scheme. 
Here each pixel is weighted by a factor proportional to |c — c center |^ , where c is the current pixel 
column, c center the central pixel column, and £ = — 5. This yields a panorama that visually appears 
almost as good as that shown in Figure 8. Note, however, that the panorama in Figure 10 is still 
not quite physically correct; the aspect ratio is still not exact. 
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Image 1 Image 2 Image (N-l) Image N 

Figure 7: Example undistorted image sequence of synthetic room. 




Figure 8: Panorama of synthetic room after compositing the sequence in Figure 7. 



To further illustrate the robustness of this approach, we have also started the iterative process 
with the original rectilinear images, (i.e., f 0 = oo), which would be the worst case focal length 
initialization. The convergence of the focal length value is: oo — > 281.18 — > 274.40 — > 274.24. 
As before, the actual focal length is 274.5. The process arrives at virtually the correct focal length 
in just two iterations. This result is very significant; it illustrates that in principal, we can start 
without a focal length estimate. 




Figure 9: Panorama of synthetic room corresponding to an erroneous focal length. 
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Figure 10: Panorama of synthetic room corresponding to an erroneous focal length, but using a 
simple weighted compositing technique. 

4 Compositing errors due to misestimation of radial distortion 
coefficient 

Another important camera intrinsic parameter that could cause errors in the compositing length is 
the radial lens distortion. If (x u , y u ) is the undistorted image location and (x d} y d ) is its radially 
distorted counterpart, then 



x v 



x d (l + K X r 2 d + K 2 r A d + ...) 
y d (l + n x r\ + K 2 r 4 d + ...) 



where 



r d = \fx 2 d + y 2 d 

For our work, we use only the first radial coefficient term k = ki\ 

x u = x d (l + Kr 2 d ) 
y u = 2/d(l + Kr d ) 

with the inverse 



(18) 



(19) 



where 



x d 

yd 



1 + Krj 



1 + Krj 



1 r, 
+ 



2k' 



1 1 r l 
+ + — 

729k 6 27k 3 2k 2 



(20) 
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Figure 11: Mapping of pixels from the second cylindrical image to the first. The transformation 
/ indicates mapping from cylindrical to rectilinear coordinates with focal length / while trans- 
formation k indicates the radial distortion mapping with radial distortion factor k. Terms with the 
subscript "true" represent the correct entities while those without this subscript represent estimated 
ones. RectO is the undistorted rectilinear image. See text. 




1 2 
+ — ( 21 ) 

~5 \ 3 OK 

1 4. iL) 1 l _J_ 4. iL\ K 2 

,27k 3 ^ 2k 2 ) 729k 6 ^ 27k 3 ^ 2k 2 J ^ 

(21) is found using Mathematica™ [Wolfram, 1991]. Details of lens distortion modeling can be 
found in [Slama, 1980]. 

The transformations required to show the effect of incorrect focal length and radial distortion 
coefficient are depicted in Figure 11. We assume that the cylindrical images are displaced by 
an angular amount a. To see how these transformations come about, consider the right half of 
the series of transformations beyond "rotate by a." We require the mapping from the correct 
cylindrical image point to the actual cylindrical image point, given estimates of / and k. To 
generate the correct undistorted rectilinear image, we have to unproject from the cylindrical surface 
to the flat rectilinear surface (/^ e ) an d then radially undistort (ft^ij. Subsequently we perform 
radial distortion (k) and cylindrical projection (/) to arrive at the estimated cylindrical image. This 
is similarly done for the second image. Equations (3), (19), (20), and (21) are used in series to 
determine the theoretical displacement, as is similarly done in section 3. The difference is that in 
calculating the mean displacement, the displacements are averaged over all the pixels in the image. 
This is because radial distortion changes both x and y coordinates, while the cylindrical projection 
changes the x component independently of y. In addition, if the camera axis passes through the 
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Figure 12: Graph of equivalent focal length error vs. error in k, the radial distortion factor. The 
true focal length (f trU e) is 274.5 and the true radial distortion factor (n true ) is 2.8 x 10~ 7 . 

image center row, the average displacement in y is zero. 

The effect of misestimating the radial distortion coefficient k for a typical value of / = 274.5 
and k = 2.8 x 10~ 7 is shown in Figure 12. As can be seen, the effect is almost linear, and 
despite significant errors in k, the resulting error in the effective focal length is small (< 1%). This 
illustrates that for typical real focal lengths and radial distortion coefficients, the dominant factor 
in the compositing length error is the accuracy of the focal length. 

The appearance of the panorama due to error in radial distortion coefficient k is not very per- 
ceptible if the radial distortion is typically small (of the order of 10~ 7 ). An extreme case that 
corresponds to a large error in radial distortion coefficient (by 10~ 5 ) can be seen in Figure 13. 
Here, a simple scheme of compositing by direct averaging is performed, and there is a perceptible 
ghosting effect. However, using the weighted compositing scheme results in a much sharper im- 
age, as shown in Figure 14. There is still some blurring effects, which is more pronounced away 
from the central horizontal row of the panorama, but this is to be expected with errors in k. 

There are two ways of measuring compositing length error: mismatch between observed com- 
positing length and expected compositing length based on estimated focal length, and mismatch 
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Figure 13: Another panorama of synthetic room corresponding to a large erroneous radial distor- 
tion coefficient (by 1.0 x 10~ 5 ). 




Figure 14: Panorama of synthetic room (same camera location as in Figure 13) corresponding to 
an erroneous focal length. 

between the correct compositing length and expected compositing length. The first error (dLO) 
measures the consistency between the estimated focal length and the observed composite length. 
The second error (dLl) metric measures the error due to the current estimate of the focal length, 
and cannot be calculated unless the true focal length is known. Figure 15 shows the variation of 
both types of compositing length error as a function of errors in estimated focal length and radial 
distortion coefficient. (The nominal focal length and radial distortion coefficient are 274.5 and 
2.8 x 10~ 7 respectively.) The error in focal length is expressed as df = f — ftrue- The error dLO 
is 2irf — L, where L is the compositing length and / is the estimated focal length. This is rele- 
vant if the estimated focal length is assumed to be correct and the composited length is adjusted 
to be compatible with the estimated focal length. In this case, the image displacement errors are 
distributed over all the frames (the simpliest method being uniform distribution). This procedure 
involves the least amount of computation as the images do not require reprojection onto a cylin- 
drical surface of a difference cross-sectional radius (i.e., focal length). It may be used in the case 
of accurately estimated focal lengths. Meanwhile, the error dLl is 2n(f — ftrue), ftrue being the 
correct focal length. This is relevant in the case of using the newly estimated focal length based 
on the composited length. As can be observed, both types of compositing length errors are more 
sensitive to the error in the estimated focal length, with dLO much more so. 
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Figure 15: Variation of compositing length error vs. errors in both focal length and radial distortion 
coefficient. The deviations are all in terms of percentages. The nominal focal length and radial 
distortion coefficient are 274.5 and 2.8 x 10~ 7 respectively. See text for descriptions of dLO and 
dLl. 

5 Effect of error in focal length and radial distortion coefficient 
on 3-D data 

The recovered 3-D data do depend on the accuracy of the estimated focal length and radial distor- 
tion coefficient. This can be seen from Figures 17 and 18. The length and breadth of the synthetic 
room are 10 and 8 units respectively. Stereo data was recovered from 3 camera locations; two 
camera locations are 0.3a/2 units away from the first or reference camera location. An example 
of a distribution of recovered stereo data corresponding to the correct focal length of 274.5 and 
no radial distortion is shown in Figure 16. Surprisingly, despite the increased numerical errors, 
the recovered 3-D data corresponding to the other (erroneous) focal lengths and radial distortion 
coefficient do not appear significantly different from that shown in Figure 16. This suggests that if 
exact reconstruction is not required and that the panorama does not have to be of high quality, then 
just using the estimated focal length directly would suffice. 

From Figure 17, it can be observed that the effect of underestimating the focal length is greater 
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Figure 16: Example recovered 3-D data (corresponding to the correct focal length of 274.5). 

than overestimating it. This is most likely due to the greater relative change in the curvature 
error (the curvature of the cylindrical surface being inversely proportional to the cross-sectional 
radius, which is the focal length) in underestimating the focal length. In addition, the effect of 
misestimating the focal length appears to be more significant on the accuracy of the reconstructed 
3-D points than does misestimating the radial distortion coefficient (assuming typical values of k 
of the order of 10~ 7 ). This suggests that as long as the field of view is not too large as to result in 
significant radial distortion, we can get by with a simple estimation of k, or by assuming no radial 
distortion. 

6 Summary 

We have analyzed the compositing error in terms of two intrinsic camera parameters, namely the 
focal length and the radial distortion coefficient. Given typical values of the radial distortion coef- 
ficient, the effect of the focal length on the compositing error is more significant than that of the 
radial distortion coefficient. An important discovery from this analysis is that the relative com- 
positing length error due to camera focal length error is disproportionately much less (i.e., in terms 
of percentages) than the relative focal length error. This enables the use of the resulting com- 
positing length to recover a better estimate of the camera focal length, and forms the basis of the 
iterative compositing approach to camera calibration. This method has the advantage of not having 
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Figure 17: Graph of RMS 3D error of recovered stereo data vs. estimated focal length. The true 
focal length is 274.5 (indicated by the vertical dashed line). 
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Figure 18: Graph ofRMS 3D error of recovered stereo data vs. error in radial distortion coefficient 
k. Typical real values of k is of the order of 10" 7 . The zero error in k is indicated by the vertical 
dashed line. 



6 Summary 



21 



to know the camera focal length when a panorama is to be generated from a sequence of images. 
In addition, it does not rely on feature detection and tracking and on a separate prior calibration 
process. 

It has also been found that the resulting composite panorama is of a much higher visual quality 
if a weighted scheme in combining overlapping regions is used. Specifically, in blending images, 
we employ a weighting distribution of an exponential form that favors pixels closer to the central 
column of the image to which they belong. 
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